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1. Introduction 

The main goal of statistical mechanics, at least from the point of view of the 
initiators like Boltzmann, Maxwell, Gibbs^, Einstein, was the derivation of the ther- 
modynamical laws from the microscopic (atomistic) structure of matter. All these 
attempts are subjected to start from some models of the structure of matter. But 
it is a well known fact that thermodynamics was constructed independently or at 
least following a parallel road on the basis of few foundamental laws that are viewed 
as empirical facts. Very recently, Lieb and Yngvason tried to clarify some aspects of 
the second law (entropy) of thermodynamics on the basis of the concept of adiabatic 
accessibility. This work was mainly motivated by the fact that usual formulations 



^Gibbs had a pragmatic point of view which was somehow different from Boltzmann's view. 
Gibbs founded statistical mechanics as a branch of rational mechanics, no matter what physical 
process generates the distribution in phase space. Contrary to that, Boltzmann's view-point was to 
really reduce thermodynamics to mechanics and consequently he necessitated an explanation of the 
mechanism that lead to equilibrium a mechanical system which was initially in a nonequilibrium 
state. To skech the differences one can say that the Boltzmann approach is more physical whereas 
the Gibbsian is more rigourous. 
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of the second law, such as Kelvin or Clausius, use concepts such as hot, cold or 
heat that are intuitive but not really well defined nor precise before the theory is 
fully developed. Their basic derivation of the second law (that is the existence of 
the entropy state function) is based on some abstract postulates of a certain kind of 
ordering on a set of states. 

From our point of view, the problem of the foundations of statistical mechanics 
is two-fold. One is: given a statistical theory, one has to extract quantities (averages 
of phase functions) and laws that can be identified with thermodynamical quantities 
and fundamental laws. The identification itself being of analogy type.^ Once those 
fundamental laws are recognized, one can logically develop the entire consequences of 
these laws. This logical enterprise was perfectly achieved by Gibbs in his celebrated 
treaty[3]. The other, less easy task, is to justify the use of the statistical theory 
(precepts) itself from a realistic^ point of view, that is, so to speak to justify the very 
use of ensembles. Differently stated, why canonical or microcanonical ensembles are 
suitable to describe real physical systems ? This question arises since it is generally 
believed that the system to be studied is in a definite state and not distributed over 
a continuum of states. It is clear that this second task is more physically related 
to the very structure of matter, and it is within this perspective, that the work of 
Boltzmann has to be viewed. The (partial) answer to this question is related to the 
fact that real thermodynamical systems are constituted, at least approximately, of a 
huge collection of particles. The discussion of these points will be largely developed 
in the next section. Section 3 deals with non-equilibrium aspects and we present 
their relations, such as the fiuctuation theorem and Jarzynski equality, which seem 
to many physicists to be of fundamental interest. In the last section we present some 
results obtained on the Ising model in the fluctuation relation context. 

2. Foundations of statistical mechanics 

2.1. Interpretation of physical quantities 

The state of our system (classically a point in the phase space or a Hilbert spec- 
tral ray quantum mechanically) fully determines the physical (dynamical) quantities 
which caracterize the given system. We will generally call such a quantity a phase 
function (classical case f{q,p), quantum case f{tp) = {ip, Qtp) where Q is the opera- 
tor associated to the quantity /). In order to have a suitable theoretical description, 
one has to identify such phase functions with the various physical quantities ob- 
tained experimentally from measurement processes and compare their respective 
value. However, in order to compare the empirical data with the theoretical predic- 
tions, one has to know the actual state of the system, that is, for example classically, 
to determine 2s (~ 10^^) coordinates. But in general, the empirical (macroscopic) 
description of what is called a (equilibrium) thermodynamical state is fully specifled 

^As it is very explicitly emphasised in Gibbs treaty [3] 

^ Given that the basic ontology is a single mechanical system composed of many subsystems 
(particles). 
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by a very small set of independant variables, such as the energy, volume, pressure, 
...So that the question that rises is which state should we choose in order to eval- 
uate the relevant phase functions and compare their value with their experimental 
counterpart ? and obviously no one has or can have any reasonable answer to such 
a question. Nevertheless, if one realizes that the measurement of a physical quantity 
is performed during a finite time, which in general is very large compared to some 
internal time scale, one realizes that the actual empirical datas are given as averages 
of the quantities over long time periods. But the initial question still survives, that is, 
which (part of the)^ trajectory the system is actually following ? In order to answer 
such a question one has to know 2s — 1 independant integrals of motion and it seems 
that a very small path has been done toward the solution since our starting puzzling 
problem of finding the 2s coordinates. At this step, as it is well known, in order to 
avoid the average over an unknown trajectory, normally one invoques ergodic theo- 
rems or hypothesis to replace time averages by phase averages. However, in general 
very few systems are known to be ergodic and it seems really unprobable that in 
a realistic case one will ever prove ergodicity. But the requirement of ergodicity is 
too strong. For instance one has simply to require that only few (corresponding to 
the empirical ones) phase functions should have time average equal to their phase 
average. So, it would be an unnecessary hypothesis to demand the validity of such 
an equality for all phase functions. Another objection that has to be emphasized 
is the fact that ergodicity is a requirement that involves average over recurrence 
time, which is too long to have any physical relevance (several astronomical orders 
of magnitude), but in actual experiment, the times involved to obtain the averages 
are by far shorter than the reccurence time, see for example for a discussion of 
this point Ref. [6]. The reason of this is lying in the fact that the majority of phase 
functions describing physical quantities exhibit a very peculiar behaviour. They arc 
approximately constant on almost all the points of the constant energy manifold 
(since we are talking here of an isolated system). Why is it so is linked to the fact 
that the mechanical systems, considered here, are breaking up into a large number 
of components and the fact that the interesting phase functions are sum functions, 
that is sum of functions depending on the dynamical coordinates of the component 
subsystems alone. 

If we suppose, for some reasons, that we can replace time-averages by phase- 
averages, then the remaining problem is to determine the suitable phase average 
procedure. In the case of an isolated mechanical system, it is usually argued that 
one has to restrict the phase average to the constant energy manifold since the actual 
trajectory is taking place into this subset of the phase space. Indeed, if one consider 

^One problem that arises is the fact that time average of a phase function on a given trajectory 
may have very different values for different time intervals. This difficulty is overcome thanks to 
a theorem due to Birkhoff, which states that for almost all trajectories, the time averages of a 
given phase function tend to a definite limit when the time interval tends to infinity. It means in 
particular that the averages over finite time intervals on a given trajectory (a typical one) will take 
approximately the same value for sufficiently large time-periods. This remark, basically, is at the 
heart of the time average procedure used widely to start an exposition of statistical mechanics, see 
for example ref. [5]. 
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the energy phase function for such an isolated system and one takes its phase average 
over several constant energy manifolds, it is clear that the average value will not 
give the actual energy value. To overcome this discrepency, one has to restrict the 
phase average on that constant energy subset corresponding to the real value. The 
argument is clear and easy to conceive. But one has to realize that every thing 
that has been said concerning the energy should also be true for the 2s — 2 other 
integrals of motion. So in particular, if we consider some other conserved quantity 
to be given as the energy is given, we have to restrict the average procedure to 
the intersection of the two corresponding manifolds, where the actual trajectory is 
taking place. Continuing on this lines, wc will finally arrive at the fact that we have 
to average over the intersection of the 2s — 1 conserved quantities manifolds, that 
is over the actual trajectory and this is precisely what wc wanted to avoid. In order 
to escape this vicious circle, let us consider a certain integral of motion defined 
by a phase function I. If I over the constant energy manifold takes approximately 
always the same value, then its time average over almost all trajectories will give 
almost the same (physical) value. Consequently, its phase average on the constant 
energy manifold will give a definite value which can be compared to the actual 
physical value. On the other hand, if this is not true and the phase function / varies 
widely on the constant energy manifold, no definite average for different trajectories 
can be affected to it and it will not have a (macroscopic) physical interpretation. 
However, if such a phase function has indeed a physical interpretation (and so an 
actual possibility of measuring it), one has to treat it on exactly the same footing 
as the energy is treated, and consequently if the value of that integral is known, we 
should restrict the phase average to the corresponding manifold of constant energy 
and constant /. It is a fact that in ordinary macroscopic systems, usually only the 
energy integral has to be considered.^ 

2.2. Microcanonical principle 

As we have seen from the arguments given above, one of the goal of a suit- 
able statistical theory is to give for some phase functions the same values as those 
obtained experimentally and compatible with phenomenological thermodynamics. 
But, as it was argued, the values of the phase functions over a subset of the phase 
space present generally fluctuations that could be very large. In the following we 
give some more arguments in the direction of a phase space average. 

A thermodynamical state is specified completly by a small set of quantities 

Qi,Q2,...,Qfe . (1) 

^One can refer to [5] for some more arguments relating the question to additive integrals of 
motion, which arc explicitely, the energy, the momentum and angular momentum components, 
that is integrals that are related to the space-time symmetries. If one encloses the system into 
a rigid vessel at rest, translation and rotation invariance are broken and only the energy integral 
remains. The additivity argument relics on the fact that the equilibrium statistical distribution over 
the restricted phase space should have a product structure related to the statistical independence 
of the different components of the system, that is related to the small interaction between the 
different components. 
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This means in particular that all physical quantities B are given as functions of 
these variables: 

B = fB{QuQ2,..;Qk) ■ (2) 

To be more precise, there is one such function, named the fundamental relation: 

^ = MQi,Q2,...,Qfc) (3) 

from which all the other quantities can be obtained by appropriate derivations. This 
is basically thermodynamics, at least if one specify some properties of that funda- 
mental function (second principle) . [8] On the microscopic side, it is clear that the 
specification of this very small set of quantities is not at all sufficient to completely 
determine, from the microscopic point of view, the state of the system. In general, for 
a given set {Qi, Q2, Qk) we have many compatible microscopic states (let us call 
them as it is usual microstate). To be more specific, wc will continue the discussion 
in the quantum case. So for the given set {Qi, Q2, Qk) we have some set^ 

V'l,'02,-,'0i,- 

of microstates to which are associated the corresponding values 

El, B2, -Bj, ... 

of the phase function such that Bi = fB{i>i)- 

Let us imagine that we had prepared a collection of N copies partially specified by 
the set Q. Measuring on each copy the quantity B we obtain with some frequency 

Ui/N the corresponding value Bi. If we want to compare some theoretical value 
associated to the macrostatc specified by the set Q, we have to consider the average 
of the quantity B over our experimental data. This is given by 

{B) = ^ (mSi + 712^2 + ... + riiBi + ...) . (4) 

The values Bi are in principle known since they are quantum mechanical expecta- 
tions over the states ■?/';• The real problem comes from the fact that no theory can 
give the values of the frequencies rii/N since they are related to the actual experi- 
mental setup. Different devices, preparation protocols in the experimental setup will 
lead to different occurences of the microstates and so of the values oi B. To solve the 
difficulty one could think of stating an average principle, that is of specifying the set 
{rii/N}. But again, in general different average principles will lead to different (B). 
In order to reconcile this observation with the uniqueness of empirical observations, 
one is naturally lead to the fact that almost all values B^ should take almost the 
same value: 

Bi (B) = S^'^^^ 

^Usually the set has the power of the continuum, so that the integer indexation used here should 
no be taken too seriously. 
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The measure of the set where this equahty doesn't hold should vanish or be rea- 
sonnably small. By calling upon the argument of simplicity, in order to calculate 
the average (B), we can choose the microcanonical average principle, that is speci- 
fying equal weights to all microstates. One has to realize here that from the given 
argument the microcanonical average principle is not unique. But, nevertheless, one 
can argue that, given all the previous (more or less of euristic type) arguments and 
the fact that the microcanonical principle generates the same macroscopic relations 
as thermodynamics does, it is legitimate to postulate it by the Laplacien "principle 
of insufficient reason".^ According to an other guideline [7], one can try to base the 
foundations on the ground set by information theory, that is to have a constructive 
criterion (maximum-entropy principle) for selecting a probability distribution on the 
basis of partial knowledge. In the case of constant energy systems, the maximum- 
entropy principle leads to the microcanonical distribution. To end this discussion, 
one can state again that not all phase functions should satisfy the average princi- 
ple requirements, only those having a macroscopic thermodynamical interpretation. 
Indeed, no one is expecting that the averaged (microcanonical or whatever) one 
particle velocity of a gaz should give the actual measured velocity of a particule of 
that gaz. 

Let us consider for an isolate system the set of all the quantum states for which 
the energy is precisely fixed to the value E. These states are statiomiary states, that 
is eigenstates of the Hamiltonian operator H with eigenvalue E. The dimension of 
this Hilbert sub-space Tig is given by the degeneracy of the eigenvalue E: 

dimTiE ^ ^{E) ^ m . (5) 

Let ipi, ...jipm be a complet orthonormal set of eigenvectors of H spanning the entire 
subset He- One has for all ^ e He the unique decomposition 

m 

*-E«^^^ (6) 
1=1 

with the scalar coefficients a sitting on the complex hypersphere S* 

m 

EKI' = i (7) 

i=l 

The microcanonical average of the phase function /q(^) = (^, Q^) is given by the 
integral over the complex hypersphere with uniforme measure. One has 

(Q) = (/q)= fS^M'^)dS*. (8) 

Together with the decomposition on the base ipi, ipm and with the parametrization 
(^k = Tke"^'^ , one arrives at 

(Q> = E(V'^, QV'fe) / o^ka^dS* (9) 

k,i 

^The principle of insufBcient reason or principle of indifference, states that if there is no known 
reason for predicating of our subject one rather than another of several alternatives, then relatively 
to such knowledge the assertions of each of these alternatives have an equal probability. 
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{Q) = Y.i^iM^k)-h,i . (10) 
k,i ^ 

so that finally 

«) = s^i:(V'^.Q« = rr«,{jj^Q}. (11) 

Due to the invariance of the trace with respect to the change of orthonormal bases, 
the average (Q) is also invariant. This final rule is the basic starting point of a 
microcanonical calculation.^ 



2.3. Suitability of the microcanonical principle 

In order that the microcanonical average of a quantity Q has a physical signifi- 
cation, it is necessary that it takes a value close to the real experimental value or 
to the value given by a real dynamical theory. As it would be shown bellow, this 
requirement would be fulfilled by sum functions for which the average {Q) is of the 
order of the number of components N of the system, that is 

(Q) = Nq (12) 

where q rcfcrcs to the one component quantity (a density like value). To show the 
proposition, let us consider the measure of the set of the vector states "if E He 
such that the probability 

P^{\Q - Nq\ > Ne) > 6 (13) 

that the quantity Q differ significantly (of the order eN) from the average value Nq 
is greater than S > 0. We have 

/ P^(\Q - Nq\ > Ne)dS* > SMg (14) 

Js* 

from which, together with the Chebyshev inequality (x^) > e^P(|x| > e), we arrive 
at 

^^^^ 

that is with the previous result on microcanonical average 

M.<^T.„,{^lQ-iV,n = ^flW), (16) 

where D{Q) is the microcanonical dispersion of the quantity Q. Since in usual phys- 
ical situations the dispersion is growing linearly (which is the content of the law of 
large numbers) with the number of components N, one arrives at 



N 



is known that the state vectors should statisfy symmetry or antisymmetry principle for 
bosons and fermions. This imply that in the given microcanonical average, one has just to restrict 
the subspace He either symmetric subspace H% or antisymmetric subspace 
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so that in the thermodynamical hmit this ratio vanishes. This means that the quan- 
tity Q would approximately agree with the phase average with a probability arbitrar- 
ily close to unity and this finally demonstrates the suitability of the microcanonical 
phase space average. We see here that the main reason for the validity of such a 
principle is the fact that thermodynamical systems are build up from myriades of 
component subsystems and that actual relevant physical quantities are sum func- 
tions. [2] Moreover, it is possible to show that the result just obtained remains valid 
if one affects some arbitrary absolutely continuous probability law, with respect to 
the measure introduced on the constant energy manifold, to the occurence of the 
state ■0- 



2.4. Canonical distribution 

Up to now, wc have always considered isolated mechanical systems, but it is clear 
that such an idealization is not realistic nor efficient from a technical point of view. 
Indeed, a real system is never completely isolated and continuously interacts with 
its surounding. If the system is initialy prepared in a given (quantum) state, due to 
the interaction with its environment, it will very rapidly make transitions among its 
accessible states. Since such transitions are induced by purely random processes, for 
a sufficient time (it is claimed that) the system will sample out all the permissible 
states with equal probability. [8] 

An other idealization is one in which the system under interest is free to exchange 
an arbitrary amount of energy with a very huge surrounding, the total system plus 
surounding being isolated. If the surrounding has good enough properties (it is then 
called a thermal bath), it basically fixes the temperature of the small (in comparison 
with the bath) system. It is possible to show rigourously that if we accept the 
principle of microcanonical average, then the small system is distributed according 
to the canonical law with a density [1,9] 



where H is the hamiltonian of the small system and where the unique remain of the 
bath is the inverse temperature j3. To arrive at the canonical, much more tractable, 
description, one can also remark that since the sole role of the bath is to fix the 
temperature of the system, it doesn't matter of what it is actually composed. So 
that, one can perfectly imagine a huge (infinite) collection of components all identical 
in nature to the system under consideration, free to exchange energy. One is naturally 
led to the canonical law and to the concept of ensemble (at least in the canonical 
case). [3] Prom here, we will not continue further in the development of the logical 
consequences of the canonical distribution. The ambition of this first part was to 
clarify some aspects of equilibrium statistical mechanics. 
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3. Steps toward non-equilibrium 

As it is well known and strongly emphasized in the specialized literature, the 
non-equilibrium situation is not as developed as the equilibrium case is. In partic- 
ular, most physicists will agree to say that one cannot speak of a non-equilibrium 
statistical mechanics, in sharp contrast with equilibrium statistical theory. Never- 
theless, in recent years there have been several developments that have led to general 
results not restricted to the vicinity of the equilibrium regime (as linear response 
theory is). In this section we briefly present some aspects of that results, namely the 
fluctuation theorem and the Jarzynski equality. 

3.1. Jarzynski equality 

The Jarzinsky equality is an "unexpected" equality relating equilibrium quanti- 
ties with the average of a non equilibrium process that can be very far from equi- 
librium. [10] More specifically, if one takes a system that is initially in a state of 
equilibrium at inverse temperature (5 with an external (work) parameter denoted by 
A and then if, within a finite time (so that we drive the system out of equilibrium) , 
one tunes the external work parameter to the new value 5, we will perform on the 
system some amount of work W which is specific of the actual microstate of the 
system. If we repeat this experiment many times and record the values W then the 
Jarzynski equality states that 

(e-^^) = jm p{W)e-^'^ = e-^^^ (19) 

where p{W) is the distribution of work within the given protocol (how we have tuned 
the work parameter) and AF = Fb — Fa is the free energy difference between the 
equilibrium states at temperature j3~^ with respectively external parameters B and 
A. In order to demonstrate the Jarzynski equality, we will follow the fines developed 
in ref.[ll], and present the case of a classical Hamiltonian dynamical system. But it 
is necessary to mention here that such an equality was also derived in tfie quantum 
case[12-14] and witfiin a stocfiastic markovian dynamics too[10,15]. Tfie Jarzynski 
equality was soon tested experimentally. One can see ref.[16] for a recent review. 

Tfie starting point of tfie demonstration given in ref . [1 1] is to consider tfie fiamil- 
tonian of tfie system and its tfiermal environment, togetfier witfi an interaction term: 

n{V- A) = Hs{x, A) + Heiy) + Uratix, y) (20) 

wfiere tfie subscript s(e) refers to tfie system(environment) fiamiltonian witfi collec- 
tive dynamical variables represented hy x{y).V = {x,y) is tfie pfiase space coordi- 
nate of tfie full system plus environment dynamical system. Tfie interaction term 
T~(-int{x-i y) is supposed to be small enougfi in order tfiat one can interpret Tis as tfie 
internal energy of tfie system of interest. Let us initially consider tfiat tfie system 
and environment are described by tfie tfiermal equilibrium Gibbs state witfi work 
parameter X = A 

pir) = -^expi-pniT;A)) (21) 
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where Z[A) is the normahzation factor. When we vary the work parameter from the 
initial value A to the final value B within a time r and a predefined protocol, the 
energy change of the system over the given microscopic trajectory {r}t is given by 

Hsixr] B) - Hsixo; A) = £ dt A^(xt; A^) 

where the first integral is interpreted as the work W performed on the system and 
so the second integral is the heat absorbed during the process. One may notice that, 
since the dynamics is hamiltonian, the work W is given by the total energy difference 

W = n{Tr;B)-n{To;A) . (23) 

To compute the average (e~^^) over the initial Gibbs state, one can use for the 
work the previous expression and write 

(e-^^) = J dTo p(ro)e-^[^(r-^)-^(r°'^)l (24) 

and using the expression (21), one arrives at 

(e-'"'') = ^ /dr„ e-'"'" -«> . (25) 



Now, by the canonical change of variables Fq Tt, and using Liouville theorem on 
the invariance of the measure under canonical transformations, one finally arrives 
at 

Z{B) 

Z{A) J z{A) 

This last relation is looking like the Jarzynski equality (19) but the partition func- 
tions entering here are that of the full system and environment. Nevertheless, one 
can arrive at the Jarzynski equality (19) simply by noting that if we are able to 
neglect the interaction term (which has to be small enough), then the partition 
functions factorize into an environment term independant of the work parameter 
and a system term depending on A. The ratio Z[B)/Z[A) can then be rewritten as 
the Jarzynski equality 

^Z{B)_^ Jdx e-^^-(-^^) ^ UB)_ ^ 
^ ' Z{A)^ Idxe-f^-H^^^'^) Z,{A) ^ ' 

where this last form referes only to quantities pertaining to the system of interest. 

Let us somehow discuss on a physical ground the Jarzynski equality. Indeed, if 
we perform on the system a reversible process by varying slowly enough the work 
parameter A from A to B, then it is clear that 
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where W is the thermodynamical work performed on the system. If the switching 
is faster enough such that the system has no time to equihbrate with the new work 
parameter, then the second law of thermodynamics states that the work W has to 
be larger so that one has for a general process the least work principle: 

W >AF (28) 

which can be written 

Wdis ^W-AF>0 (29) 

where the equality holds in the reversible case. If we rewrite the Jarzynski equality 
with Wdis — W — AF, we have simply 

(exp(-/3iyd,,)) = l. (30) 

Using Jensen's relation (exp(a;)) > exp((a;)), we see that the Jarzynski equality 
implies 

{Wdis) > (31) 

which is the least work principle if one identifies the average work (W) with the 
thermodynamical work W. From this, roughly speaking, one sees that if cr is a 
parameter that controls irreversibility (one may think of the entropy production), 
the density distribution Pa{W) 

lim p^(W) ^ 5(W - AF) 

(7 — ^0 

in the reversible case a = 0. Moreover, in the linear response regime, one expects 
gaussian fluctuations of the work W such that the density takes the form 

near equilibrium, where 9 — {{W — (W))"^) is the variance of the distribution and 
Wo = (W) the mean value of the work. Using this expression together with the 
Jarzynski equality, one can relate the fluctuation of the work to the dissipated work 
{Wdis), or to the entropy production if one notes that Ta = {Wdis) = TAS — Q ~ 
{W) - AF: 

{Wd^s) = \(5e , 

or equivalently 

a ^ kB\{{w - {w)f) 

with w = W/ksT. We recover here a fluctuation dissipation theorem, relating the 
dissipated work to the fluctuation of it. Within this reduced work variable w, the 
density of the reduced dissipated work Wd is rewritten 
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where the Boltzmann constant has been absorbed into the definition of the 
entropy production. In the reversible hmit, one has 

cr— >0 

that is no dissipation at alL 

The very interesting feature of the Jarzynski equahty is that it provides a method 
to recover free-energy differences from non-equihbrium experiments. This has been 
done recently for exemple on single molecule stretching experiments [16-1 8]. The 
possibility of recording free-energy differences was also experienced on simple theo- 
retical models as one can see [19-21] for some examples and [16,22] for recent reviews 
on the experimental and theoretical aspects respectively. 



3.2. Fluctuation theorem 

Soon after the derivation of the Jarzynski relation it was demonstrated [35] that 
such result can be derived from a much more general theorem, namely the fluctu- 
ation relation which is a statement on the fluctuations of the entropy production. 
The flrst example of such a relation was obtained numerically in ref.[23] for steady 
states systems and was soon after derived for driven thermostated deterministic sys- 
tems[25], thermostated steady state systems with deterministic [26] and stochastic 
dynamics[27-29]. The theorem can be written in the following form: 

^^'^'^^ i exp(T(7i-) (32) 



where -P(cTr) is the probability of having the entropy production rate cr^ over a time 
T. This relation is in fact an asymptotic statement which is valid in the r — > cxd limit. 
The precise statement of the theorem could be enonced in the following way: [6,22] 
Let (j(r) be the entropy production rate in the stationary non-equilibrium state of 
the dynamical system represented at the initial time by the phase space point F. 
Let us deflne the "dimensionless average entropy creation rate" 



a+T Jo 



where (7+ > is the average entropy creation rate in the steady state and where the 
-|- subscript emphasizes the positivity of it. The fluctuation theorem states that 

C(P) = C(-P)+P^+ , (33) 
where C(p) is the large deviation function of p deflned as 

({p) = lim — ln7rT-(p) 

with TTrip) the probability distribution of the variable p. The vahdity of the fluctu- 
ation theorem is very broad since it was proved within several different dynamical 
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contexts, such as reversible hyperbolic dynamical systems [26] or nondeterministic 
dynamics [27,28,25]. It was proved also that the fluctuation theorem in the limit of a 
vanishing entropy production rate implies the linear (near-equilibrium) fluctuation- 
dissipation theorem [30,31,27] . The theorem was also tested in a number of numerical 
investigations [32] and recently on granular materials and turbulent flows [33]. We 
may mention also some recent extension for stochastic systems which are not able 
to equilibrate with their environment in the limit of zero external drive [34]. 

Whereas the fluctuation theorem is an asymptotic relation. Crooks derived a 
very interesting identity in the case of stochastic microscopically reversible dynamics 
[15,35]. This identity reads 

PfW n 



where O is the entropy production of the system driven for some time r within a 
forward protocol Ai?(t) to a new state. Pp is the distribution of entropy production 
in the forward process, while Pr is the distribution of the entropy production in the 
backward or reverse process, that is when the system is driven in a time-reversed 
manner. 

A particularly intersting case is one in which the system is initially in contact with 
a thermal heat bath. The forward protocol is such that the system being initially (by 
convention we will take the far past t — — oo to label the initial time) at equilibrium 
with an external parameter A_, is driven out of equilibrium by varying the control 
parameter to a new value A+. The variation of A takes a finite time r, let us say 
within the interval [0, r]. For times larger than r, the system is equilibrating with the 
heat bath and terminates in an equilibrium state characterized by the new external 
parameter value A+. Since the equilibrium entropy is given by 5" = (— Inp(r)) = 
— p(r) In p(r), it is quite natural to identify 

s{T) = -\np{T) 

as the entropy of the microstate V. The entropy production for a given trajectory in 
phase space is then 

Q = In p(r_oo) - In p(roo) - (35) 

where p{T±^) = p{T, A±) are the canonical equilibrium densities associated to the 
initial and final equilibrium states and where Q is the heat exchanged during the 
process with the heat bath. Introducing into the previous expression the explicit 
equilibrium density 

p(r,A) = e'3^W-'^^(r'^), 
one arrives, with AE — W + Q, for the entropy production at 

Q = f3[W - AF] . 

So that the Crooks fluctuation identity can be written as a relation for the work 
performed on the system: 

Pf{W) _ ^fs^w-AF) 



Pr(-W) 



= e^^'^-^^^ . (36) 
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From this last expression, one can see that the Jarzynski identy is trivially recovered 
since 

where the normalisation condition of Pr has been used. 

An interesting point is when one considers a cyclic transformation. In this case 
the final state is identical to the initial state and AF = so that the entropy 

production is given by —Q/T. Utilising again Jensen's inequality we obtain that in 
average the entropy production is positive, which means that since work is performed 
on the system, heat is tranfered to the bath (and not the converse which is Clausius 
basic statement). 

4. Work on the 2(i-lsing model 

In this paper, we discussed aspects of equilibrium statistical mechanics and fo- 
cused some attention on non-equilibrium fluctuation relations. In order to give a con- 
cret example in the out of equilibrium situation, we give some preleminary results for 
work distributions obtained on the 2(i-Ising model [36]. The zero field Hamiltonian 
is given by 

SiSj 

iij) 

where the S — ^1 are classical Ising variables and where the sum extends over 
nearest neighbors. In the numerical simulations, wc use a Metropolis dynamics. The 
protocol we use is such that at the intial time t = 0, the system beeing in an 
equilibrium state at inverse temperature /5 and zero field, we swich on the field for 
a time r with a linear law: 

h{t) = Ht = —t . 

T 

For each initial realization, we compute the work W performed on the system by 
the external field with 

W^-H r dt M(t) 
Jo 

where M{t) — J2i Si{t) is the instantaneous magnetization of the system. 

We first show on figure 1 the work distribution obtained with an initial para- 
magnetic state for three different protocols with ending field h = 1.5 reached after 
time r = 25, r = 50 and r = 100, that is smoother and smoother protocols. As it 
can be seen on figure 1, the distributions are gaussian and behave as predicted in 
the previous section, with a dissipated work proportional to the variance of the dis- 
tribution. From the shift to the left (negative value), one can extract the free energy 
difference, since as r is increasing, the transformation is less and less irreversible 
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Figure 1. Work probablility density distribution of the 2d Ising model at /5 = 
0.2 (paramagnetic initial phase) with a linear varying field from /3/i(0) = to 
/3/i(t) = 0.3 for several time intervals r. The system size is L x L = 128^. 

(the entropy production is smaller and smaller). Since the distribution is gaussian 
(near equilibrium like regime), the free energy difference is given by 

AF = {w) - 

where 6 is the variance of the distribution. We have verified here that the Jarzynski 
equality holds, since the previous relation is satisfied and gives the same value AF 
for different r values. Moreover, from our data we find a linear dependence of the 
dissipated work with the perturbing speed H, as it is expected near equilibrium or 
more generally in the gaussian case [16], with 

{Wd^s) ^ a{H)L^H 

with a field dependent slope a{H). From the analyse of the work distributions for 
several ending fields, we arrived numerically at an expression for a{H) which is very 
well fitted by: 

a{H) = -ff^{H) = -ffitanh{f3fxH) 

with 7 ~ 0.43 and fi{(3 = 0.2) ~ 3.65, where /u(iJ) is the paramagnetic magnetiza- 
tion. Indeed, if one extracts the free energy difference as a function of the field H, 
one obtains a very good fit with the paramagnetic solution 



with a magnetic moment fi ~ 3.65 and volume k ~ 4.6. If one uses the equilibrium 
magnetization extracted from that expression: 

M{H,(3) = -— = — /itanh(/5/i//) , 
oH K 
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the dissipated work can be expressed as 

{Wd,s)^2HM{H,(3) 

so that it depends only on the equihbrium quantities of the paramagnetic gaz. 

If we decrease the temperature of the initial equilibrium state, at a moment we 
will arrive at the critical point. Of course, since our system is finite, the correlation 
length will not diverge since it will reach the boundaries of the system. Due to 
the long range critical correlations, one expects a different behaviour of the work 
performed on the system along the same linear protocol as in the paramagnetic 
phase. Indeed, it is the it can be seen on figure 2 where the work distribution 

is plotted for two ending fields, using three different protocols with r = 25, r = 50 
and r = 100. Insteed of having one pronouced peak as in the paramagnetic case, we 
have to face with two peaks distributions. 




Figure 2. Work probablility density distribution of the 2d Ising model at [3c 
(critical initial phase) with ending fields f3ch = 0.1 and f3ch = 0.3 for several time 
intervals r. The system size is L x L = 128^. The distribution with j3ch = 0.1 has 
been shifted vertically by an amount of 5.10~^. 

What we see is that the left most peak, sitting near by the reversible work AF, 
is moving very slitly toward negative values for larger time intervals r. The right 
most peak has a strong dependance with the perturbing speed H as it is obvious 
from figure 2 and seems to converge asymptotically toward the reversible value AF. 
We can understand the structure of the distribution in the following way: since our 
system is finite, even at the critical temperature /?~^, there is a finite magnetization 
of the order L'^"^/'^ . When turning on the field, if the initial magnetization is pointing 
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in the direction of the field, we expect a negative work of the order —HM, where M 
is the initial magnetization. On the contrary, if the initial magnetization is pointing 
in the opposite direction, at least for very fast protocols, one will have a positif work 
+HM since the system will not have enough time to react to the variation of the 
field. For lower field speed H, during the time interval r, part of the magnetization 
will flip in the field direction, leading to a smaller work value. It is clear that the 
real situation is more complicated than the rougth description just sketched above. 
In fact, one has to take into account the domain growth, since it will be the leading 
process for the variation of the total magnetization, and so for the work performed 
on the system. This study is currently under investigation. 

Let us finally discuss the ferromagnetic situation. In figure 3, we have presented 
the work distribution, for the three field speeds H, obtained with an ending field 
Ph = 0.1, with P = 0.7. What is seen is two very well separeted peaks, one sitting 
at a negative value W- and the other at a positive value W+ of the same order of 
magnitude. Moreover, each peak has the same weigth 1/2. For clarity, the right most 
peak has been plotted in an insert. The peaks are more or less gaussian like (in fact 

0.5 I ' , ' 1 
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Figure 3. Work probablility density distribution of the 2d Ising model at P = 0.7 
(ferromagnetic initial phase) with ending field (3h = 0.1 for several time intervals 
r. The system size is L x L = 128^. 

the right peak is a little bit asymmetric and so deviates from the gaussian behaviour) 
and one can see that as the process gets slower, the peaks gets sharper. What is also 
manifest is the fact that the left peak is almost no more translated to the left as in the 
paramagnetic case. Indeed, it corresponds to the initial equilibrium magnetization 
pointing in the direction of the field, and since at /3 = 0.7, the magnetization is nearly 
saturated, there is no magnetization gain to be expected during the swiching of the 
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field. The work is then of the order of —\HMeq{j3^ 0)|. The right peak corresponds to 
a microstate with an initial magnetization pointing in the opposite direction of the 
field. In this case, if the field is not strong enough, the macroscopic magnetization 
will stay unalterated by the presence of the opposite field and we will have more or 
less W+ ~ -\-\HMf.q{l3,Qi)\. In the slow swiching hmit, the work distribution takes 
the form 

P{W) = \5w. + \5w^ 
and so the average work (1^) is given by 



2 

while, using Jarzynski equality, the free energy difference AF is such that 

- 2 

where the argument of the exponential which is positive dominates largely the other 
exponential. Prom that, we have 

AF ~ -\W4 + ^ln2 ~ -\W4 

y 

where In 2//? has been neglected since the work is extensive. Together with the 
previous expression, we have for the dissipated work 

{Wais) = {W) - AF ~ + ^ " ~ -AF 

that is, the dissipated work is of the order of the reversible work (in magnitude). 
This striking result is due to the fact that almost all the irreversibility is putted 
in by the reversed magnetized microstatcs. If one uses as a macrostatc only those 
microstates with the magnetization pointing in the field direction, the situation 
would have been more usual, that is gaussian with small dissipation. Prom the data 
presented here, the ratio of the dissipated work with the free energy is less than 
5.10"^, which explains why there is no apparent shift toward the left of the curves 
for decreasing perturbation speed H. 

We close here the discussion of these numerical preliminary results. The point 
was to give an example where we can extract some equilibrium quantities within 
non-equilibrium (computer) experiments. However, one may notice that in order 
to obtain accurate results for the free energy difference from the Jarzynski equal- 
ity, one has to sample out very efficiently the tails (at least in the left most part) 
of the distribution since we need to compute exponentials of extensive quantities 
(work). But nevertheless, this program seems to be realistic enough and is currently 
under progress. Pinally, one may think of realizing such situations experimentally, 
measuring nanoparticule's magnetization within a SQUID experiment [37]. 
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